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Abstract 

Syntrophic acetate-oxidizing bacteria have been identified as key organisms for 
efficient biogas production from protein-rich materials. They normally grow as 
lithotrophs or heterotrophs, producing acetate through the Wood-Ljungdahl 
pathway, but when growing in syntrophy with methanogens, they reportedly 
reverse this pathway and oxidize acetate to hydrogen and carbon dioxide. How- 
ever, the biochemical and regulatory mechanisms behind the shift and the way 
in which the bacteria regain energy remain unknown. In a genome-walking 
approach, starting with degenerated primers, we identified those gene clusters 
in Syntrophaceticus schinkii, Clostridium ultunense, and Tepidanaerobacter acetat- 
oxydans that comprise the formyltetrahydrofolate synthetase gene (fhs), encod- 
ing a key enzyme of the Wood-Ljungdahl pathway. We also discovered that 
the latter two harbor two fhs alleles. The fhs genes are phylogenetically sepa- 
rated and in the case of S. schinkii functionally linked to sulfate reducers. The 
T. acetatoxydans fhsl cluster combines features of acetogens, sulfate reducers, 
and carbon monoxide oxidizers and is organized as a putative operon. The 
r. acetatoxydans fhsl cluster encodes Wood-Ljungdahl pathway enzymes, which 
are also known to be involved in CI carbon metabolism. Isolation of the enzymes 
illustrated that both formyltetrahydrofolate synthetases of T. acetatoxydans were 
functionally active. However, only fhsl was expressed, confirming bidirectional 
usage of the pathway. 



Introduction 

Syntrophic acetate oxidation (SAO) has been shown to 
occur in anaerobic environments such as lake sediments 
(Nusslein et al. 2001), oil reservoirs (Nazina et aL 2006), 
and nutrient-enriched soQs (Chauhan and Ogram 2006). 
Moreover, SAO has been shown to occur in biogas pro- 
cesses (Zinder and Koch 1984; Schnurer et al. 1999; 
Karakashev et al. 2005, 2006; Schnurer and Nordberg 
2008). In general, methane in biogas processes is consid- 
ered to mainly result from the action of aceticlastic metha- 
nogens (Zinder 1984). However, with increasing ammonia 
levels, released during the degradation of protein-rich 
materials, this group of organisms is inhibited and acetate 
is instead oxidized to H2 and CO2 by syntrophic acetate- 
oxidizing bacteria (SAOB), thermodynamically driven by 



the H2 consumption of hydrogenotrophic methanogens 
generating methane (Schnurer and Nordberg 2008; 
Westerholm et al. 2012). Other factors such as acetate 
concentration, operational parameters, and microbial 
community structures have also been considered to influ- 
ence the acetate conversion pathway (Karakashev et al. 
2005, 2006). In addition to previous investigations, a 
recent study using samples from nine randomly selected 
large-scale digesters in Sweden revealed that methane is 
generated through SAO rather than aceticlastic methano- 
genesis (unpublished). Thus, SAO appears to be of much 
greater importance for the biogas process than assumed 
previously and therefore aceticlastic methanogenesis can 
no longer be considered the main pathway. 

Decades of research have focused on aceticlastic 
methanogens and have neglected SAOB. Therefore, only a 
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few species of SAOB have been isolated and little is 
known about their physiology and biochemistry. To date, 
four isolates belonging to the physiological group of acet- 
ogens have been characterized and three of these, intro- 
duced as Clostridium ultunense (Schniirer et al. 1996), 
Tepidanaerobacter acetatoxydans (Westerholm et al. 
2011b), and Syntrophaceticus schinkii (Westerholm et al. 
2010), were isolated in our laboratory. The fourth species, 
the thermophilic Thermacetogenium phaeum, was isolated 
by Hattori et al. (2000). It has been suggested that the 
mechanism for converting acetate syntrophically occurs 
by an oxidative Wood-Ljungdahl (W-L) pathway (Lee 
and Zinder 1988; Schniirer et al. 1997; Hattori et al. 
2005), as used by certain sulfate reducers and aceticlastic 
methanogens (Fig. 1). However, in this case, the process 
is highly endergonic (AG°' = +95 kj/mol) when using 



protons as electron acceptor. It has been postulated that 
the immediate consumption of hydrogen gas by the 
methanogenic partner keeps the hydrogen partial pressure 
low enough to make the reaction sufficiently exergonic 
(AG°' = -36 kJ/mol) and allows ATP synthesis (Schink 
1997). The W-L pathway, used by SAOB in a reductive 
way when growing heterotrophically, can be viewed as a 
series of reactions resulting in the reduction of two mole- 
cules of CO2 to a bound methyl and carbonyl group, 
which finally form the acetyl moiety of acetyl-CoA 
(Fig. 1). It is currently unknown whether there are any 
differences between the pathways operating in one direc- 
tion (forming acetate) and the other (consuming acetate). 
Moreover, other acetogens such as Moorella thermoacetica, 
Thermoanaerobacter kivui, or Acetobacterium woodii seem 
unable to use this pathway in an oxidative way and to 




Figure 1. The Wood-Ljungdahl pathway used by acetogens, sulfate-reducing bacteria (SRB), aceticlastic methanogens, and syntrophic acetate- 
oxidizing bacteria (SAOB). In the methyl branch, free CO2 becomes reduced by formate dehydrogenase (1) to formate, which is subsequently 
attached to tetrahydrofolate in an ATP-dependent reaction catalyzed by W'°-formyltetrahydrofolate synthetase (2). The A/'^-tetrahydrofolate- 
bound formyl group is further reduced via W^,A/'''-methenyl- and A/^,W^°-methylene- to W^-methyl- by the activities of N^,N^°- 
methenyltetrahydrofolate cyclohydrolase (3), A/^,W^°-methylenetetrahydrofolate dehydrogenase (4), and W^,W'°-methylenetetrahydrofolate 
reductase (5). The methyl group is transferred by a methyltransferase (6) to a corrinoid enzyme (7), which interacts with the carbonyl branch. 
Within the carbonyl branch, a second CO2 molecule is reduced to an enzyme-bound carbonyl group by a bifunctional enzyme consisting of a 
carbon monoxide dehydrogenase subunit (CODH) (8), and an acetyl-CoA synthase subunit (ACS). The ACS (9) function finally produces an acetyl 
moiety from the bound methyl and carbonyl groups, which is subsequently attached to Coenzyme A. Acetyl-CoA can either be incorporated into 
cell material or converted to acetate via phosphotransacetylase (10) and acetate kinase (11) forming an ATP. CM, cytoplasmic membrane. 
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grow syntrophically with hydrogenotrophic methanogens 
(Winter and Wolfe 1980; Zinder and Koch 1984; Cord- 
Ruwisch et al. 1988). Thus, SAO seems not to be a com- 
mon physiological feature of acetogens. 

To further understand the activity of SAOB in different 
environments, it is critical to understand the respective 
biochemical and regulatory mechanisms behind acetate 
production and acetate oxidation. To start with, we 
identified the fhs genes of T. acetatoxydans, S. schinkii, and 
C. ultunense encoding formyltetrahydrofolate synthetase, a 
key enzyme of the W-L pathway, by developing a new 
degenerated primer pair. We then analyzed the gene struc- 
ture adjacent to the fhs genes by genome walking and inves- 
tigated fhs mRNA expression in a syntrophic coculture. We 
also cloned two of the/?js genes and determined the specific 
activities of the gene products after purification. Thus, this 
article presents the first steps toward understanding the 
thermodynamically unique SAO pathway. 

Experimental Procedures 

Bacterial strains and growing conditions 

Clostridium ultunense strain Esp (JCM 16670), S. schinkii 
strain Sp3 (DSM 21860), and T. acetatoxydans strain Rel 
(DSM 21804) were cultivated at 37°C in basal medium 
modified as described by Schnurer et al. (1996) in the 
presence of 10 mmol/L lactate, 10 mmol/L betaine, and 
10 mmol/L glucose, respectively. A coculture consisting of 
C. ultunense, S. schinkii, T. acetatoxydans, and the hydro- 
genotrophic Methanoculleus sp. MABl was cultivated in 
the same medium supplemented with 10 mmol/L acetate 
and 0.2 mol/L NH4CI. Thermacetogenium phaeum strain 
PB (DSM 12270) was obtained from Deutsche Sammlung 
von Mikroorganismen und Zellkulturen (DSMZ) (Braun- 
schweig, Germany) and cultivated as recommended by 
DSMZ. The C. ultunense strain BST deposited in the 
DSMZ has been identified as impure and was therefore 
replaced in this study by strain Esp, newly isolated in our 
laboratory (Westerholm et al. 2010). The Escherichia coli 
strains JM109 (Promega, Madison, WI) and BL21 (DE3) 
(Novagen, Merck KGaA, Darmstadt, Germany) were 
grown at 37°C in Luria-Bertani (LB) medium. 

DNA isolation 

Genomic DNA was isolated and purified using the 
DNeasy Blood & Tissue Kit from Qiagen (Hilden, 
Germany) and concentrated by salt/ethanol precipitation 
according to Sambrook et al. (1989). DNA purity was 
confirmed by amplifying and sequencing the respective 
16SRNA gene using PuReTaq Ready-to-go PGR beads 
(GE Healthcare, Buckinghamshire, U.K.) and the universal 



primer pair EC9-26f (GAGTTTGATCMTGGCTCA, modi- 
fied "fD2" and 926r; Weisburg et al 1991; Jernberg and 
Jansson 2002). To check purity more sensitively, the 
amplified 16sRNA gene in the case of T. acetatoxydans 
and C. ultunense was purified (QIAquick PGR Purification 
Kit, Qiagen), cloned into the pGEMTeasy vector (Pro- 
mega), and analyzed by colony PGR following the manu- 
facturer's instructions using E. coli JM109 as host strain. A 
total of 40 clones each were screened for alien DNA by 
sequencing 16SRNA. Plasmids were isolated from LB- 
grown cultures using the QIAprep Spin Miniprep Kit 
from Qiagen according to the manufacturer's instructions. 

RNA isolation 

RNA was isolated from exponentially growing pure cul- 
tures or cocultures using the RNeasy Mini Kit from Qia- 
gen according to the manufacturer's instructions with the 
following modification: Gells were disrupted by bead beat- 
ing (speed 5.5 for 40 sec) using a homogenizer and glass 
beads from the FastDNA soil kit from MP Biomedicals 
(France). Genomic DNA was removed using DNasel from 
Fermentas (Germany), as suggested by the manual. Com- 
plete DNA digest was checked by PGR using PuReTaq 
Ready-to-go PGR beads and gene-specific primers as listed 
in Table 1 (application: mRNA expression). PGR included 
an initial denaturation at 95°G for 3 min, 35 cycles of 
denaturation at 95°G for 30 sec, annealing at 60°G for 
30 sec, and elongation at 72°G for 40 sec, followed by a 
final elongation step at 72°G for 7 min. 

mRNA expression studies 

DNAsel-treated RNA samples were reverse transcribed 
using RevertAid Premium Reverse Transcriptase and ran- 
dom hexamer primers from Fermentas as recommended. 
cDNA were used directly in PGR as recommended, 
applying same conditions and primers as mentioned 
above. 

Construction of DNA libraries 

Four different DNA libraries each of C. ultunense, 
S. schinkii, and T. acetatoxydans were constructed using the 
Glontech Universal GenomeWalker kit (Glontech Labora- 
tories, GA). According to the manufacturer's instructions, 
the respective genomic DNA was digested with Hindi 
(New England Biolabs, Herts, U.K.), EcoRV, Pvull, or Stul 
(Glontech) and subsequently ligated to the GenomeWalker 
Adaptor resulting in uncloned, adaptor-ligated genomic 
DNA fragments referred as "libraries." In addition, Dra I 
libraries were generated of C. ultunense and T. acetatoxydans. 
All DNA libraries were stored at — 20°G. 
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Table 1. Primers used for genome walking, primer walking, and mRNA expression studies. 



Primer name 


Sequence' 


Organism/source 


Gene^ 


Application 


I ESP GSP1 up 

II ESP GSPl down 


AAATCCATCnCTCTAGGTACACCGn 
CTGAAGAACTAGGTTCAGTAGCAGTAC 


Clostridium ultunense strain Esp 
C. ultunense strain Esp 


fhs} 


Upstream walk 
Downstream 

walk 
Downstream 

walk 
Upstream walk 
Upstream walk 
Downstream 

walk 


III ESP GSP2 down 


GAGACGGTGGAATCGAATTAGCTAAGA 


C. ultunense strain Esp 


fhs} 


I 2.fhs ESP GSPl up 

II 2.fhs ESP GSP1-3 up 

III 2.fhs ESP GSPl down 


CTCCTCCATGGGAACCACCTGTGAGTA 
G GTGG ATCTTC GCTTTATATTTTC C GT 
CGGAAGGGGCCATGGCCCTTTTGTTAA 


C. ultunense strain Esp 
C. ultunense strain Esp 
C. ultunense strain Esp 


fhs2 
fhs2 
fhsl 



I Sp3 GSPl up 

II Sp3 GSP2 up 

IV Sp3 GSPl down 

V Sp3 GSP2 down 

III Sp3 GSP1-1 up 

VI Sp3 GSP1-1 down 

VII Sp3 AB downstream 
primer 



Syntrophaceticus schinkii strain Sp3 



TGTGTGCGTTCATGATGTCATTGATGT 

AGCGGCAGACCCTTTAATGnCATAGT S. schinkii strain Sp3 

nGATGGCCATTTTGGCTGTGGCAAAA S. schinkii strain Sp3 

AACAAGAAGGGTGAGCCGGTTACAACA S. schinkii strain Sp3 

ATAACACCAGAGGAAGGCAAGACCTAT S. schinkii strain Sp3 

TGCACTGGAACTGGCAGATGCTGnAT S. schinkii strain Sp3 



nCTGCAAGCATAACCTGCAGCACTTT 



S. schinkii strain Sp3 



fhs 
fhs 
fhs 

fhs 

fhs 
fhs 

fhs 



Upstream walk 
Upstream walk 
Downstream 

walk 
Downstream 

walk 
Upstream walk 
Downstream 

walk 
Primer walking 



I first primer upstream GSPl nTTCCATCATATGTATAACCAATTAT 



Tepidanaerobacter acetatoxydans strain 
Rel 



fhs} Upstream walk 



II nested primer upstr GSP2 

III first primer upstr GSPl -2 

IV nested primer up GSP2-2 

V DL2 upstream primer 

VI GSPl-3up 

VII nested GSP2-3up 

VIII DL1 2.3-kb up primer 
VIV GSP1-4up 

I I.fhs Rel GSPl down 

II I.fhs Rel nested GSP2 
down 

III I.fhs downstr pr (A1/A2) 

IV I.fhs GSP1-2 down 

V DL1 (Al) 2.fhs downstr 
prB' 

VI DL1 (Al) 2.fhs downstr 
pr3' 

VII I.fhs Rel GSPl up 
Vill I.fhs Rel GSP2 up 
VIV I.fhs Rel GSP2-2up 
X I.fhs DL4 upstream pr 
X first primer down GSPl 



GCAAGCCTATTTTTCAAATCnCCAT 

nACCGAGCCTTCTGAGTGCATCACCT 

TTTGCCTTCACCTGCAGGAGTTGGATT 

ATAG C AG GTGCTG C ATC AATG C C CTC 

GTCTGCATCACAGCCGATATGTGTTGA 

TGGCCCGAGGCACCACATTCATCTTTT 

nGTGCAAAGCATGCACTGCTCTAATT 

CAAAGATAACCGAnCGGCAATCCCAA 

CATATGATCGGGAGGGTAAGCCCGTTA 

CAAGGTGCCATGGCGGCTCTACTCAAA 

GCCCTGAAATAACACCCnnCATCCA 
CGCGGAGCCCTTAAAGCAATGGCAAAT 



GGGCTTACCCTCCCGATCATATGCTAC 

CnCGCCAGCCGTTCTTTCAAATCCAT 

CGACCTTTGCTTTATATTTGCCGTACA 

TGCCTGACAGCAAGATCATGAATAATT 

AAAGAGAGCTGGCACTGGTTCAAGAAG 



XI nested primer down GSP2 GCAGTGCTTTCCGAGGTTTGGGCAAAA 



XII DL3 downstream primer 

XIII DL3 downstream primer2 
1 .fhs locus rev 

I.fhs locus rev 
1 .locus Oxido fw 



AGCACTTACCGGTGCTATCATGACAAT 
ATGTTCATCGCCCTGTGGTGCAACATA 
CCAnACTATGCACAAACCTAAAGCCT 
TCAATGGGATCATAnCCGGAGTTACA 
TTATTTGC G ATTATC C GC AG G AAAG AC 



T. 


acetatoxydans strain 


Rel 


fhsl 


Upstream walk 


T. 


acetatoxydans strain 


Rel 


fhs} 


Upstream walk 


T. 


acetatoxydans strain 


Rel 


fhs} 


Upstream walk 


T. 


acetatoxydans strain 


Rel 


fhs} 


Upstream walk 


T. 


acetatoxydans strain 


Rel 


fhs} 


Upstream walk 


T. 


acetatoxydans strain 


Rel 


fhs} 


Upstream walk 


T. 


acetatoxydans strain 


Rel 


fhs} 


Primer walking 


T. 


acetatoxydans strain 


Rel 


fhs} 


Upstream walk 


T. 


acefafoxyc/ans strain 


Rel 


fhsl 


Downstream 










walk 


T. 


acetatoxydans strain 


Rel 


fhsl 


Downstream 










walk 


T. 


acetatoxydans strain 


Rel 


fhsl 


Primer walking 


T. 


acetatoxydans strain 


Rel 


fhsl 


Downstream 










walk 


T. 


acefafoxyc/ans strain 


Rel 


fhsl 


Primer walking 


T. 


acetatoxydans strain 


Rel 


fhsl 


Primer walking 


T. 


acetatoxydans strain 


Rel 


fhsl 


Upstream walk 


T. 


acetatoxydans strain 


Rel 


fhsl 


Upstream walk 


T. 


acetatoxydans strain 


Rel 


fhsl 


Upstream walk 


T. 


acetatoxydans strain 


Rel 


fhsl 


Primer walking 


T. 


acetatoxydans strain 


Rel 


fhs} 


Downstream 










walk 


T. 


acetatoxydans strain 


Rel 


fhs} 


Downstream 










walk 


T. 


acetatoxydans strain 


Rel 


fhs} 


Primer walking 


T. 


acetatoxydans strain 


Rel 


fhs} 


Primer walking 


T. 


acetatoxydans strain 


Rel 


fhs} 


Locus verification 


T. 


acetatoxydans strain 


Rel 


fhsl 


Locus verification 


T. 


acetatoxydans strain 


Rel 


fhs} 


Locus verification 
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Table 1. (Continued) 


Primer name 


Sequence' 


Organism/source 


Gene^ 


Application 


2. locus Methyltr fw 


ATGATTATTATTGGGGAAAAGATTAAC 


T. acetatoxydans strain Rel 


fhsl 


Locus verification 


Esp l.ffis RNA fw 


TAGACCAAACGGTGTACCTA 


C ultunense strain Esp 


fhsl 


mRNA expression 


Esp ^.fhs RNA rev 


AAGTAAAGCTACTGCTCCCT 


C. ultunense strain Esp 


fhs} 


mRNA expression 


Esp I.fhs RNA fw 


AAAGAAAAACGGAGTGCCTC 


C. ultunense strain Esp 


fhsl 


mRNA expression 


Esp Z.fhs RNA rev 


CTTTAACAAAAGGGCCATGG 


C. ultunense strain Esp 


fhsl 


mRNA expression 


Rel 1 .fhs RNA fw 


GGCTGCAACAGTATAATAGC 


T. acetatoxydans strain Rel 


fhs} 


mRNA expression 


Rel 1 .fhs RNA rev 


TCTTGACCCCGCCATTATAT 


T. acetatoxydans strain Rel 


fhs} 


mRNA expression 


RpI 7 fh^ RNA fw 


TGTTGTAGrATATGATrGGG 


T ^retatnyuH^n^ ^trsin RpI 


fhsl 


mRNA pynrp^^inn 

1 1 1 1 \ l M CAU 1 C J J lU 1 1 


Rel 2. fhs RNA rev 


TTACCGAGTTACATCCGTGT 


T. acetatoxydans strain Rel 


fhsl 


mRNA expression 


Sn^ fhs RNA fw 


rTrGAAAGArGGTTTCTTGA 


^ srhinlcii <;tr^in Sn3 

-J. II! jL[ Gil 1 uyJ-J 


fhs 


mRNA pynrp^^inn 

1 1 1 1 \ l M CAkJ 1 C J J lU 1 1 


Sp3 fhs RNA rev 


TTGATCGTGTTACGCATCCA 


S. schinkii strain Sp3 


fhs 


mRNA expression 


T. phaeum tinis study fw 


CGGACCCACCATGAACATTAAGGGTA 


Thermacetogenlum phaeum strain PB 


fhs 


Authenticity 










check 


T. phaeum this study rev 


AAGTCGATAACCCAGCCCAniCCAC 


T. phaeum strain PB 


fhs 


Authenticity 










check 


T. phaeum AB523739 fw 


ATCnGGCTCTAACTACCGGCCTCAA 


T. phaeum strain PB 


fhs 


Authenticity 










check 


T. phaeum AB523739 rev 


CTCGTTGCCATCATATCGGCCAGTAT 


T. phaeum strain PB 


fhs 


Authenticity 



check 



M13fw GTAAAACGACGGCCAGTG Invitrogen Sequencing 

Ml 3 rev GGAAACAGCTATGACCATG Invitrogen Sequencing 

Adaptor primer 1 (API) GTAATACGACTCACTATAGGGC GenomeWalker Kit Sequencing/walk 

Nested Adaptor primer 2 ACTATAGGGCACGCGTGGT GenomeWalker Kit Sequencing/walk 
AP2) 



Primers were designed using Geneious v4.5 and synthesized by Invitrogen (Carlsbad, CA). 
'Sequences are indicated as 5' to 3'. 

^Gene used to start genome walking or targeted in mRNA expression studies. 



Genome walking 

Genome walking was carried out using gene-specific 
primers (Table 1) targeting the partial fhs gene of the 
respective strain in combination with the AP1/AP2 primer 
set (Clontech) directed to the adaptor sequence. The sim- 
plified touchdown PGR was performed according to the 
manufacturer's protocol using the Advantage 2 PGR kit 
(Glontech) or PuReTaq Ready-to-go PGR beads. If neces- 
sary, a nested PGR was performed as recommended by 
Glontech. The fragments obtained ranged in size from 0.4 
to 3 kb and were purified using QIAquick PGR Purifica- 
tion Kit or Gel Extraction Kit (Qiagen) before sequencing. 
Fragments larger than 1.5 kb were cloned into the 
pGEMTeasy vector as described above. The resulting plas- 
mids were further analyzed by primer walking (Table 1) 
and/or sequenced by the M13 primer set. 

Verification of T. acetatoxydans fhs loci 

Accurate genome walking and assembly of the two sepa- 
rately located fhs loci were verified by amplification of 
5.0 kb (primer combination A), encompassing the 5' fhsl 
tail and its upstream region, and 2.1 kb (primer combina- 



tion B), encompassing the 3' fhsl tail and its downstream 
region. The fhsl locus was confirmed in the same way, gen- 
erating a 1.5-kb fragment (primer combination G) encoding 
the 5' fhsl tail and its upstream region and a 4.0-kb frag- 
ment (primer combination D) encoding the 3' fhsl and its 
downstream region (data not shown). Therefore, a touch- 
down PGR was performed including an initial denaturation 
at 95°G for 5 min, 11 cycles of denaturation at 94°G for 
60 sec, annealing at 63°G for 60 sec (decreased by 1°G per 
cycle to 53°G), and elongation at 72°G for 210 sec followed 
by 32 cycles consisting of 94°G for 60 sec, 53°G for 60 sec, 
and 72°G for 210 sec, and finalized by 7 min at 72°G. The 
reaction system consisted of 10 pmol of each primer, 20 ng 
genomic DNA, and Ready-to-go PGR beads. The primer 
combinations A-D were set up as follows: (A) 1. locus Oxido 
fw/nested primer upstream GSP2; (B) First primer down- 
stream GSPl/l.fhs locus rev; (G) 2.1ocus Methyltr fw/l.fhs 
Rel GSP2 up; and (D) 2.flis Rel nested GSP2 down/l.fhs 
locus rev (primer sequences are listed in Table 1). 

DNA sequencing 

Sequencing was performed by Uppsala Genome Genter, Swe- 
den using costumer provided primers as listed in Table 1. 
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Prediction and annotation of CDS 

Sequence assembly, editing, and prediction of coding 
sequences (CDSs) and open reading frames (ORFs) were 
achieved by Geneious v5.4 (Drummond et al. 2011). All 
CDSs were double checked and corrected manually by 
comparing the predicted protein sequences with the pub- 
licly available database of GenBank using BLAST (Altschul 
et al. 1990). BROM, available online at http://www.soft- 
berry.com, was used as the promoter-prediction program. 
MEME (MEME@ncbi.net) was used for analyzing palin- 
dromes and repetitive sequences. Codon usage patterns 
were analyzed on http://www.geneinfinity.org. 

Tree construction 

Multiple sequence alignments were performed using MUS- 
CEL 3.8.31 (Edgar 2004). A maximum-likelihood tree was 
constructed using PhyML 3.0 (Guindon and Gascuel 2003) 
choosing the WAD substitution matrix and MEGA5.05 
(Tamura et al. 2011). MUSCEL 3.8.31 and PhyML 3.0 are 
both available on the Mobyle platform of Institut Pasteur. 

Plasmid construction 

Plasmid pBM06 was constructed by ligating a PGR frag- 
ment encompassing the fhsl gene (encoding FTHFSl) of 
T. acetatoxydans as an Ndel/BamHl fragment with the 
expression vector pETlSb (Novagen). Plasmid pBROl was 
constructed likewise carrying the fhsl gene (encoding 
FTHFS2) of T. acetatoxydans. A partial digest was per- 
formed in the case of Ndel due to an internal restriction 
site. Sequencing revealed a change in genotype (T->G) at 
bp 425 for fhsl compared with the sequence obtained by 
genome walking. At this stage of the study, it was not 
clear which nucleotide reflected reality. However, multiple 
sequence alignment using 80 randomly picked publicly 
available formyltetrahydrofolate synthetases (FTHFS) 
revealed the affected residue (Leui42^Ser) to be not con- 
served and located outside any catalytic site. 

Purification procedures 

FTHFSl and FTHFS2 of T. acetatoxydans were purified as 
N-terminal His-tagged fusion proteins from cell extract of 
E. coli strain BL21 (DE3) (Novagen) harboring plasmid 
pBB06 or pRBOl, respectively. GeUs were grown under vig- 
orous shaking in LB medium at 37°G and overexpression 
of fhsl or fhsl was induced by adding 0.5 mmol/L IPTG at 
OD 0.3 for 3 h. Cells were harvested and resuspended in 
100 mmol/L Tris/HCl, pH 7.5, containing 50 /tg/mL 
DNase (Sigma-Aldrich, Seelze, Germany). Cell extract was 
recovered by centrifugation after disruption of the cells 



using a French press. Both recombinant proteins were puri- 
fied by IMAC immobilized metal-affinity chromatography) 
using TALON metal-affinity resin from Clontech. The 
washing buffer contained 0.3 mol/L NaCl in 0.05 mol/L 
Tris/HCl, pH 7.5. Proteins were eluted with 0.15 mol/L 
imidazole added to the washing buffer. Elution fractions 
enriched with the respective protein were pooled and sub- 
sequently washed with 0.15 mol/L NaCl in 0.1 mol/L Bis- 
Tris/HCl, pH 6.9 and concentrated at the same time using 
Vivaspin 20 concentrator columns (Sartorius Stedim Bio- 
tech, cut-off size 10 kDa). All solutions were supplemented 
with EDTA-free protease inhibitor cocktail (Roche Diag- 
nostics, Mannheim, Germany) as recommended. To keep 
the proteins soluble at higher concentrations, 5 mmol/L 
DTT (dithiothreitol) was added to FTHFS 2 and 2 mmol/L 
DTT to FTHFS 1. Proteins were stored at -20°C. 

FTHFS assay 

Enzymatic formylation of cf,Z-tetrahydrofolate was mea- 
sured at 60°C based on the assay method of Rabinowitz 
and Pricer (1962). The reaction mixture contained 
100 mmol/L sodium formate, 10 mmol/L NH4CI, and 
2 mmol/L (i,Z-tetrahydrofolate (Sigma-Aldrich; stabilized 
with 1 mol/L /?-mercaptoethanol at pH 7.5) in 100 mmol/L 
HEPES, pH 8.3. In order to calculate the specific 
activities, protein was added in a range from 0.05 to 
50 /.(g/mL. After 5 min preincubation at 60°C, the 
reaction was started by adding ATP/MgCl2 to a final 
concentration of 5 mmol/L and 10 mmol/L, respectively. 
Samples were taken after 1, 2, 3, 4, 5, and 10 min, and the 
reactions were stopped by adding 0.36 N HCl in a ratio of 
1:2. HCl converts the product 10-formyltetrahydrofolate to 
5,10-methenyltetrahydrofolate (fissonm = 24,900 L* moP'* 
cm^^), which was measured spectrophotometrically at 
350 nm (Infinite M200 plate reader, TECAN) after 10 min. 
Specific activity was calculated from product absorbency, 
linearly proportional to time and number of micrograms. 
AH activity measurements were run in triplicate. 

Other analytical procedures 

Protein determination was carried out using the protein 
assay kit from Bio-Rad. SDS-PAGE (SDS-polyacrylamid 
gel electrophoresis) was performed using precast 4-15% 
mini-PROTEAN SDS-PAGE gels and the respective 
equipment purchased from Bio-Rad. Mass spectrometry 
was carried out by the Department of Medical Biochemis- 
try and Microbiology (Biomedical Center, Uppsala, Swe- 
den) using MALDI/TOF (Ultraflex MALDI TOE, Bruker 
Daltonics). DNA was visualized by agarose-ethidium bro- 
mide electrophoresis using 1.0% or 1.5% agarose gels and 
1-kb DNA ladder from Fermentas. 
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Nucleotide sequence accession number 

The nucleotide sequence data reported in this study were 
deposited at GenBank with accession numbers as follows: 
JQ979072 (r. acetatoxydans fhsl locus), JQ979073 
(T. acetatoxydans fhsl locus), JQ979075 (C. ultunense fhsl 
locus); JQ979076 (C. ultunense fhsl locus), JQ979074 (S. 
schinkii fhs locus), and JQ979077 (T. phaeum partial /Tis). 

Design of degenerated FTHFS primers 

FuH-length FTHFS sequences of acetogens and nonaceto- 
gens including Alkaliphilus metalliredigens (CP000724), 
Acetobacterium woodii (AF295701), Blautia hydrogeno- 
trophica (ACBZ01000230), Clostridium difficile (FN54816), 
Clostridium formicaceticum (AF295702), Carhoxydothermus 
hydrogenoformans (CP000141), Clostridium magnum 
(AF295703), Clostridium acetobutylicum (AE001437), Clos- 
tridium beijerinckii (NC009617), Clostridium cellulolyticum 
(CP001348), Clostridium cylindrosporum (L12465), Clos- 
tridium phytofermentas (CP000885), Desulfitobacterium 
hafniense (CP001336), Eubacterium eligens (CP001104), 
M. thermoacetica (CP000232), Symbiobacterium thermo- 
philum (AP006840), Syntrophomonas wolfei (CP000448), 
T. kivui (AF295704), Anaerostipes caccae (DS499743), 



Bacillus cereus (EEL11738), Clostridium acidiurici 
(M21507), Clostridium botulinum (CP000727), Clostrid- 
ium kluyveri (CP000673), Clostridium novyi (CP000382), 
Clostridium perfringens (CP000246), Clostridium 
thermocellum (CP000568), Clostridium tetani (AE015927), 
Eggerthella lenta (CP001726), Eubacterium rectale 
(CP001107), Selenomonas sputigena (CP002637), Slackia 
heliotrinireducens (CP00168), and Thermoanaerobacter 
spec. (CP000923), as well as partial FTHFS sequences of 
homoacetogens and sulfate reducers (Leaphart and Lovell 
2001; Leaphart et al. 2003; Matsui et al. 2008; Wester- 
holm et al. 2011a), were ahgned in Geneious v5.4 with 
ClustalW. A new primer pair designated 3-SAOfhs-iw 
(CCNACNCCNGCHGGNGARGGNAA) and 3-SAO-rev 
(ATRTTNGCRAADGGNCCNCCRTG) was designed tar- 
geting identified conserved stretches (Fig. 2). 

PGR amplification of fhs genes 

The primer pair FTHFSf/FTHFSr designed by Leaphart and 
Lovell (2001) was used in a touchdown PGR as described, 
but with the annealing temperature decreased down to 
53°G. PGR reactions were performed using Ready-to-go 
PGR beads, 20 ng genomic DNA, and 20 pmole of each 
primer. Alternatively, PGR conditions were adjusted as 
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Figure 2. Nucleic acid sequence alignment of the degenerated primer-binding sites exemplified with 18 full-length fhs genes, (a) bp and amino 
acid numbers are based on the A/^°-formyltetrahydrofolate synthetase of Moorella thermoacetica. (b) FTHFS primer pair designed by Leaphart and 
Lovell (2001). For further details, see Results section. 
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described by Lovell and Leaphart (2005) using 49°C and 
47°C, respectively, as annealing temperature. The optimized 
touchdown PGR used for the 3-SAO/?i5-fw/rev primer pair 
was performed after an initial melting step at 94°C for 
5 min comprising 11 cycles consisting of 60 sec at 94°C, 
60 sec at 63°C (decreased by 1°C per cycle to 53°C), and 
60 sec at 68°C, followed by 30 cycles at 94°C for 60 sec, 
53°C for 60 sec, and 68°C for 60 sec, finalized by 20 min at 
68°C. PGR reactions consisted of Ready-to-go PGR beads, 
20 ng genomic DNA, and 50 pmole of each primer. The 
PGR conditions used for the specific T. phaeum primer 
(listed in Table 1) targeting either the partial /?is gene iden- 
tified in this study or the partial /Tis gene (AB523739) identi- 
fied by Hori et al. (2011) were as follows: 95°G 5 min; 30 
cycles of 94°G for 1 min, 61°G for 1 min, 72°G for 1 min, 
finaUzed by 5 min at 72°G. Here, 10 pmole of each primer 
were used. 

Results 

Tepidanaerobacter acetatoxydans and 
Clostridium ultunense harbor two fhs alleles 

The FTHFSf/FTHFSr primer pair designed by Leaphart 
and LoveU (2001) has been used successfully for targeting 
partial fhs species in numerous environmental samples 
and in pure cultures (see Experimental Procedures sec- 
tion). However, in the case of SAOB, the recommended 
touchdown PGR generated an amplicon for C. ultunense, 
but not for T. acetatoxydans, S. schinkii, and T. phaeum. 
After reducing the annealing temperature down to 47°C 
as suggested by LoveU and Leaphart (2005), an impure 
amplification was also achieved for T. acetatoxydans (data 
not shown). Hence, based on an alignment of the 
deduced polypeptide sequences of a total of 27 full-length 
fhs genes (representatives are shown in Fig. 2) and an 
additional 40 partial fhs genes (see Experimental Proce- 
dures), a new primer pair was designed (Fig. 2): The 
forward primer 3-SAO/?is-fw shows a degeneracy at seven 
positions of a total of 23 base pairs and is directed to the 
WalkerA motif, a common feature of all members of the 
super family of P-type NTPases (Walker et al. 1982). For 
the reverse primer 3-SAO/?!S-rev, a stretch of seven amino 
acid residues resulting in a sevenfold degenerated 23mer 
primer was identified as highly conserved within the 
FTHFS, but wdth fewer similarities to the P-type NTPases 
super family. The primer pair amplifies approximately 
635 base pairs encoding a polypeptide of 211 amino acid 
residues encompassing the signature I motif and 
the cesium/potassium-binding site (Radfar et al. 2000), 
allowing appropriate allocation of the sequences obtained. 

In the case of SAOB, it was possible to generate ampli- 
cons from purified genomic DNA of all three strains con- 



sidered in this study and of T. phaeum at the expected 
size. However, sequencing of the PGR products revealed 
impure amplification, and the PGR products were there- 
fore cloned and further analyzed. To our surprise, two 
different partial fhs genes of T. acetatoxydans and of C. 
uhunense were found. For S. schinkii and T. phaeum, only 
one partial fhs gene each was identified. 

The covered fhs genes are highly 
phenotypically distinguished 

In order to evaluate the FTHFS from SAOB phylogeneti- 
cally, we completed the adjacent upstream and down- 
stream sequences of the covered partial fhs genes 
by genome walking using adaptor-ligated libraries of 
C. uhunense, T. acetatoxydans, and S. schinkii in combina- 
tion with gene-specific primers. Maximum-likelihood tree 
construction based on the deduced amino acid sequences 
revealed them to be highly phenotypically distinguished 
from each other and with partly poor identities to known 
acetogens. As indicated in the dendrogram (Fig. 3), the 
FTHFS 1 of T. acetatoxydans formed a tight group with the 
recently sequenced acetogen Thermosediminihacter oceani 
isolated from sea floor sediment (Pitluck et al. 2010), 
showing 85.4% sequence identity (Table 2). Distinctly sep- 
arated from FTHFS 1, the FTHFS2 of T. acetatoxydans clus- 
tered together with Glostridia species described as 
autotrophic acetogens. Thus, Clostridium carboxidivorans 
as the nearest relative showed 78.7% sequence identity, fol- 
lowed by Clostridium ljungdahlii and C. magnum with 
78.7% and 77.5% identity, respectively. The FHTFSl of 
C. ultunense was found to be clustered together with 
FTHFS amplified from purinolytic Clostridia, lacking 
the W-L pathway. The level of sequence identity between 
C. ultunense and C. cylindrosporum and Clostridium acidu- 
rici was 71.5% and 70.2%, respectively. However, the phy- 
logenetic classification was rather low, with only 16.3% of 
the bootstrap replicates. The FTHFS2 of C. ultunense was 
substantially outgrouped and showed no similarity to acet- 
ogens and purine fermenters or to sulfate reducers. A pro- 
tein-protein blastp search performed against the 
nonredundant protein database revealed Desulfitobacterium 
metallireducens as the closest relative, with 66% identity. 
The FTHFS of both S. schinkii and T. phaeum were found 
to be tightly grouped within a cluster representing mainly 
known sulfate reducers. Although the sequence identity 
between T. phaeum and S. schinkii was significantly high 
(81.4%), the bootstrap value was below 50% because of the 
truncated polypeptide of T. phaeum used in the alignment. 
The autotrophic acetogen Acetonema longum and the sul- 
fate reducer Desulfosporosinus meridiei had the closest rela- 
tive identities, ranging from 80.9% to 83.1% compared 
with S. schinkii and T. phaeum, respectively (Table 2). 
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Figure 3. Phylogenetic placement of deduced amino acid sequences of the full-length fhs genes of Clostridium ultunense, Syntrophaceticus 
5chinl<ii, and Tepidanaerobacter acetatoxydans, and the partially recovered fhs gene of Ttiermacetogenium ptiaeum. Positions are highlighted in 
red. The tree is based on maximum-likelihood alignment. Bootstrap values supported by 1000 replicates are indicated at the nodes if larger than 
50%. GenBank accession numbers are given in brackets. 



Table 2. Pairwise distance among the different FTHFS obtained and between them and their closest relatives as shown by the phylogenetic tree. 
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Distances are given as identity (%) and are represented by slnades of gray depending on identity level (dark gray = Inigin identity, light gray = low 
identity). Pairwise distances were performed in Geneious vB.4 using MUSCEL. 



Two large insertions within FTHFS connect 
S. schinkii and T. phaeum with sulfate 
reducers 

Multiple sequence alignment revealed two large insertions, 
which seem to be common to members of the S. schinkii/ T. 



phaeum cluster (Fig. 4, not all members are shown). Thir- 
teen additional amino acid residues are inserted between 
residues 152 and 153 referring to the numbering of the 
FTHFS of M. thermoacetica. Another 8-amino acid inser- 
tion is located between residues 343 and 344, followed by a 
2-amino acid insertion within the N-terminus after residue 
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Figure 4. Multiple sequence alignment of the full-length FTHFS of Tepidanaerobacter acetatoxydans, Syntrophaceticus schinkii, and Ciostridium 
ultunense, and the partial FTHFS of Thermacetogenium phaeum in comparison with their closest relatives. Only the amino acid residues 100-437 
are shown based on the amino acid number of the FTHFS sequence of Moorelia thermoacetica. Sequence alignment was performed in Geneious 
vB.4 using MUSCLE. 



15 and two single amino acid insertions after residues 478 
and 550, respectively (the latter not shown). Controver- 
siaDy, the recently published partial sequence of T. phaeum 
(AB523739; Hori et al. 2011) fell outside the S. schinkii/T. 
phaeum cluster and exhibited only poor identity to the par- 
tial sequence obtained in this study. Even though the poly- 
peptide showed the same 8-amino acid insertion between 
residues 343 and 344, the first large insertion between 
amino acid residues 152 and 153 was missing (data not 
shown). However, subsequent 16sRNA analysis approved 
the genotype as T. phaeum strain PB and confirmed the 
DNA purity used for PGR. The question therefore arose as 
to whether T. phaeum might harbor two fhs alleles, as 
found in T. acetatoxydans and C. ultunense, which might 
not be targeted by one and the same degenerated primer 
pair. Specific primers intended to target the different partial 
sequences of T. phaeum were designed, but only the partial 



fhs gene obtained by the 3-SAO/7is primer pair was recov- 
ered (data not shown). 

Tepidanaerobacter acetatoxydans FTHFS1 
and FTHFS2 were functionally active, but 
only FTHFS1 was expressed 

The interesting finding of phylogeneticaUy distinguished 
fhs genes harbored by one strain raised the question of 
which fhs gene is expressed in those strains harboring two 
alleles. To investigate this, mRNA samples from a cocul- 
ture containing C. ultunense, T. acetatoxydans, and 
S. schinkii growing syntrophically with a hydrogenotrophic 
methanogen on acetate were compared against mRNA 
samples of the respective pure culture growing hetero- 
trophicaUy on lactate, glucose, and betaine, respectively. 
No differences in mRNA expression pattern were 
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Figure 5. Fhs expression pattern in response to Ineterotroplnic growtin 
conditions producing acetate (A) and to syntroplnic growtin conditions 
consuming acetate (B). (1) Tepidanaerobacter acetatoxydans fhs]; 
(2) T. acetatoxydans fhsl; (3) Clostridium ultunense fhs] ; (4) Ciostridium 
ultunense fhsl; and (B) Syntrophaceticus schinlaii. 




Figure 6. SDS-PAGE of His-tagged FTHFS1 (A) and FTHFS2 (B) of 
Tepidanaerobacter acetatoxydans. Purification was carried out by 
TALON metal-affinity chromatograplny as described under 
Experimental Procedures. M, molecular weight markers; C, cell 
extract; W, washing step; lane 1-14 (A) and lane 1-12 (B) = elution 
steps. Respective His-tagged FTHFS is framed and marked by an 
arrow. 

observed (Fig. 5). The fhsl gene of both T. acetatoxydans 
and C. ultunense was found not to be expressed in either 
type of culture. However, the respective fhsl gene was 
clearly expressed. The fhs gene of S. schinkii was expressed 
in both coculture and pure culture. 

Moreover, both FTHFS of T. acetatoxydans were puri- 
fied to homogeneity and shown to be active as His-tag 
fusion proteins. The SDS-PAGE migration behavior was 
according to the deduced molecular weight of 60 kDa for 
a monomer (Fig. 6A and B). In addition, protein identity 
was confirmed by mass spectrometric peptide mapping. 



The specific activity of FTHFS 1 was with 23 U/mg pro- 
tein, approximately three times lower than the specific 
activity of FTHFS2, with 60 U/mg protein. 

The fAi5l gene of T. acetatoxydans is 
surrounded by genes belonging to both the 
methyl and carbonyl branch of the W-L 
pathway forming a putative operon 

To obtain more information about the gene structure sur- 
rounding the respective fhs gene, we continued genome 
walking upstream and downstream. In the case of 
T. acetatoxydans fhsl, the flanking genome segments were 
analyzed up to 8.2 kb, encompassing eight further ORFs, 
while in the case of T. acetatoxydans fhsl, the flanking 
genome segments were analyzed up to 6.3 kb, encoding 
six more ORFs. The location, size, and deduced poly- 
peptide length of these are summarized in Figure 7. 
Amplification of large fragments from each T. acetatoxy- 
dans fhs locus confirmed the accurate assembly and the 
existence of two clearly distinguished loci (see Experimen- 
tal Procedures). Within the first fhs locus, an ORF desig- 
nated acsA was identified upstream to fhsl encoding a 
putative carbon monoxide dehydrogenase (CODH), the 
key enzyme of the carbonyl branch of the W-L pathway 
(Fig. 1). Adjacent downstream to acsA, a gene for a puta- 
tive cobyrinic acid a,c-diamide synthase was found. The 
function of this enzyme has been described as essential 
for maturation of the Ni center of CODH (Kerby et al. 
1997), and was named cooC. The deduced polypeptide of 
the third ORF [cooF) located upstream of acsA showed 
high similarity to iron-sulfur cluster-binding proteins. An 
ORF (ORFl) encoding a putative oxidoreductase was 
found to be divergently orientated to the fhsl cluster. 
Analysis of the deduced amino acid sequence of the three 
ORFs adjacent downstream to flisl identified genes for 
two more putative enzymes of the W-L pathway: a 
methenyltetrahydrofolate cyclohydrolase designated fchA 
and a partial methylenetetrahydrofolate dehydrogenase 
designated folD (Fig. 1). The third ORF (ORF3), when 
blasted against the nonredundant protein database, 
showed low similarities to proteins with unknown func- 
tion (as did ORF2) and therefore could not be allocated 
functionally. Two putative promoter sequences PI and P2 
(Fig. 7) were found to be arranged as face-to-face pro- 
moters (Beck and Warren 1988) within the intergenic 
region flanked by cooF and the divergently transcribed 
ORFl. As the intergenic regions located dovmstream of 
cooF are rather short and exhibited no further predictable 
promoter elements, the fhsl locus was considered a puta- 
tive operon. It was possible to predict a putative third pro- 
moter sequence (P3) at a distance of 57 bp from the start 
codon of ORF2. 
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Figure 7. Gene organization of the fhs loci of Tepidanaerobacter acetatoxydans identified by genome walking. Genome walk primers as listed in 
Table 2 are depicted in Latin numerals; direction of arrows reflects upstream or downstream walking. Gene sizes and deduced polypeptide 
lengths are given in bp and aa, respectively. P1/P2/P3, putative promoter sites; IR, intergenic regions given in bp; SD, Shine-Dalgarno sequence; 
aa, amino acid residues; bp, base pairs. Enzyme restriction sites are not shown for the sake of clarity. ORF, open reading frame; cooC, Ni-insertion 
protein required for CODH maturation; metF, methylenetetrahydrofolate reductase; acsf, methyltransferase; cooS, CODH subunit; fchA, 
methenyltetrahydrofolate cyclohydrolase; folD, methylenetetrahydrofolate dehydrogenase; fhs, formyltetrahydrofolate synthetase; and cooF, Fe-S 
protein. 



The fhs2 gene of T. acetatoxydans is 
embedded in genes belonging to the methyl 
branch of the W-L pathway 

The five ORFs clustered with the second fhs were found 
to be organized into two putative operons (Fig. 7), sepa- 
rated by a single gene adjacent to its own putative pro- 
moter (P2). Interestingly, the fhs2 locus also possesses 
genes encoding a putative methenyltetrahydrofolate cyclo- 
hydrolase (fchA) and a methylenetetrahydrofolate dehy- 
drogenase (folD), but those were not organized within 
one operon as seen for the fhsl locus. Moreover, it 
proved possible to allocate two more ORFs to the W-L 
pathway: a putative methylenetetrahydrofolate reductase 
(metF) and a methyltetrahydrofolate methyltransferase 
(acsE) (Fig. 1). FolD and metF were found to be orga- 
nized in one operon adjacent to a putative promoter 



sequence (P3) separated by an ORF (ORF2) that could 
not be allocated functionally. The second operon consist- 
ing of the putative promoter sequence PI, the methyl- 
transferase gene acsE, and the second fhs gene was flanked 
upstream by a large AT-rich intergenic region of 489 bp, 
followed by a partial ORF (ORFl) that could not be 
related to any protein function. 

All ORFs within the fhsl locus were found to have an 
AUG start codon, whereas fchA, folD and ORF2 belong- 
ing to the fhsl locus exhibited the alternative start codons 
GUG, UUG, and UUG, respectively. The corresponding 
Shine-Dalgarno sequences were identified within a dis- 
tance of 7 bp to 11 bp from the respective initiation co- 
dons. No significant variation in GC content or average 
codon usage was observed within or between the different 
fhs clusters. However, all deduced proteins from the first 
fhs locus belonging to the W-L pathway were most simi- 
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lar to proteins from T. oceani, ranging from 60% to 85% 
identity, whereas the proteins from the second fhs locus 
were rather identical to proteins from C. carboxidivorans 
or C. ljungdahlii except for methylenetetrahydrofolate 
reductase and methyltransferase. The identities of the 
deduced polypeptides of fhs2, fchA, ORF2, and folD 
ranged from 63% to 79% compared with C. carboxidivo- 
rans and 62% to 79% compared with C. ljungdahlii. The 
methylenetetrahydrofolate reductase exhibited 74% iden- 
tity to the protein of Clostridium ragsdalii or C. difficile, 
but still 69% identity to the protein of C. carboxidivorans. 
On the contrary, the methyltransferase was most similar 
(81% identity) to a protein of Thermincola potens. 

Comparing the two FTHFS of T. acetatoxydans, 74% of 
the amino acid residues were identical and 86% were sim- 
ilar, whereas only 54% sequence identity and 75% simi- 
larity, respectively, were observed between the 
methenyltetrahydrofolate cyclohydrolases of these two 
loci. The putative oxidoreductase clustering together with 
the //isl operon showed no similarities to any of the 



oxidoreductases of the bacteria mentioned above. Instead, 
it exhibited 60% identity to a protein of the Geobacillus 
species WCH70. 

The fhs genes identified in 5. schinkii and 
C. ultunense were located separately from 
other genes of the W-L pathway 

No further genes belonging to the W-L pathway could 
be identified within the adjacencies to the fhs genes of 
S. schinkii and C. ultunense (Fig. 8). A putative metal- 
dependent phosphohydrolase and a partial dihydroptero- 
ate synthase encoded by genes designated as fol_PH and 
fol_P, respectively, were identified downstream of the 
S. schinkii fhs gene. The short 37-bp intergenic region 
and a putative promoter sequence (P3) identified 
upstream of fol_PH suggested an operon consisting of at 
least fol_PH and fol_P. A putative nucleoside di- 
phosphate kinase gene designated fol_NDK was found 
located upstream of fhs and separated by a large inter- 
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Figure 8. Gene organization of tlie fhs loci of Syntrophaceticus schinkii and Clostridium ultunense identified by genome walking. Genome walk 
primers as listed in Table 2 are depicted in Latin numerals; direction of arrows reflects upstream or downstream walking. Gene sizes and deduced 
polypeptide lengths are given in bp and aa, respectively. P1/P2/P3, putative promoter sites; IR, intergenic regions given in bp; SD, Shine-Dalgarno 
sequence; aa, amino acid residues; bp, base pairs. Enzyme restriction sites are not sinown for the sake of clarity. ORF, open reading frame; fol_PH, 
phosphohydrolase; fol_P, dihydropteroate synthase; fol_NDK, nucleoside diphosphate kinase; oppF, oligopeptide/dipeptide ABC transport system; 
fhs, formyltetrahydrofolate synthetase; and panK, pantothenate kinase Typlll. 
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genie region of 317 bp, including a putative promoter 
sequence (P2) for flis. The diliydropteroate syntlietase 
catalyzes the condensation of p-aminobenzoic acid and 
6-hydroxymethyl-7,8-dihydropterin pyrophosphate in the 
de novo biosynthesis of folate (de Cre'cy-Lagard et al. 
2007). As this pathway requires both phosphoryl group 
transfer reactions and phosphoryl group hydrolysis, the 
putative fol_PH and fol_NDK genes might also belong to 
the folate biosynthesis pathway. Fol_NDK and fol_P were 
found to have GTG instead of ATG as their start codon. 
Putative Shine-Dalgarno sequences were found within a 
distance of 6-10 bp. No differences in codon usage or 
GC content were observed. The deduced polypeptides of 
fol_NDK, fol_PH, and folP of S. schinkii were most iden- 
tical to proteins of Pelotomaculum thermopropionicum 
(74%), T. potens (54%), and Desulfotomaculum kuznetso- 
vii (60%), respectively. The respective similarities were 
85%, 73%, and 75%. 

The fhsl gene of C. ultunense clustered together with a 
gene encoding the ATPase subunit of a putative oligo- 
peptide/dipeptide ABC transport system (oppF) separated 
by a large intergenic region of 308 bp. Furthermore, a 
partial ORF was found located downstream of fhsl encod- 
ing a partial polypeptide consisting of 35 amino acid resi- 
dues. An operon structure seems to be likely, due to the 
fact that putative promoter elements were only found 
upstream of the flis gene and the short intergenic region 
between fhs and oppF. The GC content and the codon 
usage pattern were equal within this cluster, but GTG and 
TTG were found as start codons for fhs and the adjacent 
ORF, respectively. The deduced polypeptide of oppF 
exhibited 89% similarity and 64% identity to an oligo- 
peptide ABC transporter of Alkaliphilus oremlandii. The 
partial polypeptide deduced from the ORF showed 
features of the helix-turn-helix superfamily, but could 
not be farther allocated. Analysis of the second fhs locus 
of C. ultunense encountered difficulties because no pure 
upstream fragment larger than 132 bp could be amplified. 
The fragment obtained seemed to represent an intergenic 
region without any regulatory or promoter sequences. 
The downstream-located ORF designated panK encodes a 
partial putative pantothenate kinase Type III, which cata- 
lyzes the phosphorylation of pantothenate as the first step 
of its conversion to coenzyme A. The short intergenic 
region between fhsl and panK did not show any pro- 
moter elements, which might point to an operon struc- 
ture. The blastp search algorithm identified Bacillus 
coagulans as the closest relative to panK, sharing 61% 
sequence identity and 76% similarity. Compared with the 
fhsl locus, a remarkable discrepancy in GC content was 
observed. Within the fhsl locus, the GC content ranged 
between 28.6% and 35.5%, whereas within the fhsl locus, 
it varied from 49.1% to 51.2%. Moreover, distinct 
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changes in codon usage were detected on comparing the 
two fhs loci of C. ultunense. 

Discussion 

To our knowledge, only five complete genomes of aceto- 
genic bacteria, covering T. oceani (Pitluck et al. 2010), 
C. Ijungdahlii (Kopke et al. 2010), M. thermoacetica 
(Pierce et al. 2008), Eubacterium limosum (Roh et al. 
2011), and A. woodii (Poehlein et al. 2012) have been 
sequenced and all genes encoding the enzymes of the W-L 
pathway have been annotated (The last one was published 
quite recently and therefore not analyzed comparatively.). 
In addition, the genome of the strictly CO-utUizing bacte- 
rium C. hydrogenoformans (Wu et al. 2005), the metal- 
reducing bacterium A. metalliredigens (Ye et al. 2004), and 
the W-L pathway gene cluster of the CO-utUizing C. carb- 
oxidivorans (Bruant et al. 2010) have been reported. The 
putative operon identified here within the T. acetatoxydans 
fhsl locus was found to be identical in gene organization 
to the W-L pathway gene cluster of T. oceani (Fig. 9) and 
most identical concerning gene identities. As the 16sRNA 
genes showed 89% sequence identity, a common ancestor 
is rather more likely than horizontal gene transfer. More- 
over, the locus structure seems to be highly conserved 
when compared with the fhs gene clusters of C. Ijungdahlii, 
A. metalliredigens, and C. carhoxidivorans (Fig. 9), leading 
us to speculate that the T. acetatoxydans flisl locus might 
continue in the same way encoding all proteins of the W-L 
pathway. Interestingly, two additional genes designated 
cooF and hyp were found to be part of the operon in 
T. acetatoxydans fhsl and T. oceani, but not in the other 
three bacteria species mentioned. CooF homologs, ana- 
lyzed by the blastp search algorithm, were mainly found in 
sulfate reducers and CO oxidizers, but seemed not to be 
common in acetogens. Moreover, the organization of cooF 
showed similarities to the Coo regulon of Rhodospirillum 
rubrum (Singer et al. 2006) and to C. hydrogenoformans 
(Wu et al. 2005), where cooF, cooS, and cooC are arranged 
in the same way as in T. acetatoxydans and T. oceani 
(Fig. 9). These bacteria can grow using carbon monoxide 
as the sole carbon and energy source, producing H2 and 
CO2 by using homolog enzymes of the carbonyl branch of 
the W-L pathway. Here, CooF is proposed to be the elec- 
tron transfer mediator between CODH and the respective 
CO-induced hydrogenases, coupling the oxidation of CO 
to CO2 with H2 evolution. In _R. rubrum, a gene cluster 
460 bp upstream of the Coo regulon encodes a scc-subunit 
[NiFe] hydrogenase that catalyzes H2 evolution, likewise 
in C. hydrogenoformans. In T. acetatoxydans, the diver- 
gently orientated putative oxidoreductase located 
upstream of the fhsl cluster might have functional similar- 
ities to the [NiFe] hydrogenase complex mentioned above. 
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This clearly distinguished T. acetatoxydans from T. oceani, 
where the upstream structure adjacent to cooF does not 
show a divergently orientated ORF. In consideration of 
the finding that no homolog proteins could be identified 
in T. oceani or in any other acetogenic bacteria, the 
upstream region might have been modified by horizontal 
gene transfer. This assumption is supported by the finding 
of a DNA fragment identified farther upstream of the 
T. acetatoxydans fhsl locus encoding a partial sequence of 
a putative transposase belonging to the IS4 family (data 
not shown). Interestingly, the hypothetical protein {hyp or 
ORF3) unique within the fits cluster of T. oceani and 
T. acetatoxydans shared, with few exceptions only, identi- 
ties with hypothetical proteins of archaea including 
members of the families Methanosaetaceae and Methano- 
sarcinaceae, some of which are known to be partly aceti- 
clastic methanogens. The fact that no function was 
aUocatable might point to an unknown regulatory protein. 
The abundance of repetitive and palindromic sequences 
(data not shown) identified within the intergenic region of 
the fhs operon and the divergently orientated putative oxi- 
doreductase indicates a rather strongly regulated operon. 
The putative face-to-face promoters PI and P2 encom- 
passing several of those putative regulatory sites suggest 
simultaneous up- and downregulation of the fhs operon 
and the putative oxidoreductase, indicating a function of 
the latter connected to acetogenesis and/or acetate oxida- 
tion. However, the possibility cannot be excluded that 
other sequences may function as weak promoter sites 
resulting in back-to-back promoters, which might allow 
alternative regulatory mechanisms. In conclusion, an 



operon encoding enzymes of the entire W-L pathway 
might be one of the essential adaptations needed for the 
energy-limited syntrophic lifestyle, as presumed for the 
methylmalonyl-CoA pathway operon of the syntrophic 
propionate oxidizer P. thermopropionicum (Kato and 
Watanabe 2010), due to increasing the transcriptional 
efficiency. 

The second fhs locus seems to be unique to T. acetat- 
oxydans: None of the bacteria mentioned above (or any 
other acetogen characterized) have been found to harbor 
a second fhs gene. The cluster organization mainly shows 
similarities to the cluster found in E. Umosum regarding 
gene composition and operon structure (Fig. 9). No 
further genes belonging to the W-L pathway were found 
adjacent to the methylenetetrahydrofolate reductase gene 
[metF) in E. Umosum and the same situation may also 
occur in T. acetatoxydans. As none of the ORFs showed 
any similarities to genes of T. oceani or to any more clo- 
sely related bacteria, the additional gene set might have 
been achieved once by horizontal gene transfer. Although 
both FTHFSs were shown to be active and the second 
locus showed a functional gene organization, the mRNA 
expression studies suggest that only the T. acetatoxydans 
flisl operon is functional under heterotrophic and syn- 
trophic growing conditions. However, depending on the 
habitat, there may be other growing conditions where the 
fhsl operon and subsequently the W-L pathway genes 
become repressed. As indicated by the findings reported 
above, the T. acetatoxydans fhsl cluster seems to be regu- 
lated rather than expressed constitutively. It has been 
shown for A. woodii, when growing heterotrophically in 
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Figure 9. Comparison of tine Tepidanaerobacter acetatoxydans fhs clusters to tine ftis gene clusters of Alkaliphilus metalUredigens, Clostridium 
carboxidivorans, C ljungdahlii, Thermosediminibacter oceani, and Eubacterium Umosum and to the coo cluster of Carboxydothermus 
hydrogenoformans and Rhodospirillum rubrum involved in CO oxidation. acsC/acsD, corrinoid iron-sulfur protein; cooC, Ni-insertion protein 
required for CODH maturation; metF, methylenetetrahydrofolate reductase; acsf, methyltransferase; acsB, acetyl-CoA synthase subunit; coo5/ 
acsA, CODH subunit; hyp, hypothetical protein; fchA, methenyltetrahydrofolate cyclohydrolase; folD, methylenetetrahydrofolate dehydrogenase; 
folP, dihydropteroate synthase; fhs, formyltetrahydrofolate synthetase; acoL, dihydrolipoamide dehydrogenase; cooF, Fe-S protein; cooM/K/UX/U/ 
H, six-subunit [NiFe] hydrogenase; Toce_0798, amidohydrolase; and hypDH, putative dehydrogenase. 
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coculture with a hydrogentrophic methanogen, that the 
electron flow is directed toward hydrogenases, evolving 
H2 directly consumed by the methanogens and bypassing 
the W-L pathway (Winter and Wolfe 1980). Likewise in 
P. productus, electrons are directed away from the W-L 
pathway when the CO2 partial pressure becomes low 
(Misoph and Drake 1996). Under these circumstances, 
the activities of FTHFS, methylenetetrahydrofolate dehy- 
drogenase, methenyltetrahydrofolate cyclohydrolase, 
methyltransferase, and methylene reductase are still essen- 
tial for the one-carbon metabolism. It is conceivable that 
the second fhs locus represents an alternative gene cluster 
supplying the cells with CI units at different oxidation 
levels, helping out when the first locus is becoming 
repressed. This hypothesis is supported by the fact that 
only proteins belonging to the methyl branch were found 
to be encoded by the second jhs locus, as analyzed and 
compared with E. limosum. Moreover, the three putative 
promoter sequences suggest more flexible up- and down- 
regulation compared with the first fits locus, which might 
enable the cells to respond to the demand of CI units at 
different oxidation levels in a more efficient way. 

In contrast, the fhs genes identified in S. schinkii and 
C. ultunense were found to be located separately from 
other genes of the W-L pathway, as is the case in M. ther- 
moacetica. Interestingly, the fhsl gene identified in C. ul- 
tunense was found to be mostly similar to that of purine- 
fermenting Clostridia, which do not harbor the W-L path- 
way genes, but instead use the reverse reaction of FTHFS 
to generate ATP during purine fermentation as the only 
energy source. In other words, this group recruits the same 
enzyme function as needed by SAOB, when running the W 
-L pathway in an oxidative direction. The operon structure 
described here distinguishes C. ultunense from M. thermo- 
acetica insofar as a putative regulatory protein might be 
encoded downstream of fhsl allocated by a putative helix- 
turn-helix motif known from DNA-binding proteins. 
Accordingly, palindromic sequences were found upstream 
of the promoter, suggesting a putative transcription activa- 
tor recognition site (data not shown). Unfortunately, to 
date, none of the genomes from purinolytic bacteria, 
which might have given us additional information, have 
been published. The second fhs gene identified in C. ultun- 
ense did not show any similarities to known acetogens. 
The difference in GC content and codon usage points 
clearly to a horizontal gene transfer event. Moreover, only 
fhsl was found to be expressed under both acetate-produc- 
ing and acetate-consuming conditions. A similar alterna- 
tive regulation mechanism as suggested for the T. 
acetatoxydans fhs clusters might be possible, but on the 
other hand, the differences in DNA character and codon 
usage might reduce or even prevent efficient transcription 
and translation. 



In both S. schinkii and M. thermoacetica, the respec- 
tive fhs gene seems to be associated with an operon 
encoding enzymes necessary for the de novo synthesis 
of tetrahydrofolate, the coenzyme of FTHFS. The S. 
schinkii fhs gene was found to be grouped closely to 
sulfate-reducing bacteria. These bacteria use the W-L 
pathway in the oxidative way when fermenting organic 
components and direct the electrons through the W-L 
pathway to sulfate and regain energy (Fig. 1). The gen- 
ome of the closest sulfate-reducing relative [Desulfoto- 
maculum orientis) encodes two fhs genes lying apart 
from each other and from other W-L pathway genes. 
It is possible that a putative second fhs gene cannot be 
detected in S. schinkii using this primer pair. Otherwise, 
the mRNA expression study showed the identified fhs 
gene as clearly expressed in both types of growing con- 
ditions. Interestingly, two large insertions were found 
that distinguished the FTHFS of S. schinkii and T. pha- 
eum clearly from other known FTHFS and connected 
them to the sulfate reducers. Based on the crystal struc- 
ture of FTHFS from M. thermoactica (Radfar et al. 
2000), these large insertions might dramatically alter the 
region, which is involved in tetramerization. All bacte- 
rial FTHFS purified to date have been reported to be 
functional as homotetramers (Scott and Rabinowitz 
1967; Brewer et al. 1970; MacKenzie and Rabinowitz 
1971; O'Brien et al. 1976; Marx et al. 2003). However, 
none of these has been isolated from a sulfate reducer. 
Thus, additional residues might change the quaternary 
structure toward a more stable tetramer or even a 
functional dimer affecting the overall enzymatic activity 
of the pathway Interestingly, another acetogen, A. Ion- 
gum, was found to be aligned within the sulfate reduc- 
ers (Fig. 3), closely related to the SAOB S. schinkii and 
r. phaeum. This species was first isolated from the gut 
of wood-feeding termites (Kane and Breznak 1991), but 
to date, none of the gut commensals have been 
reported to be syntrophic acetate oxidizers. 

Finally, it must be stated that the phylogenetic diversity 
observed here for FTHFS from the SAOB and the allelo- 
morphism demand reevaluation of the degenerated pri- 
mer pairs used in acetogenic community studies with 
respect to their specificity for SAOB and associated con- 
clusions regarding species abundance. 
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